To what extent does case contribute to verb sense disambiguation?
نویسندگان
چکیده
Word sense disambugation has recently been utilized in corpus-based aI)proaches, reflecting the growth in the number of nmehine readable texts. One (:ategory ()f al)l)roa(:hes disambiguates an input verb sense based on the similarity t)etween its governing (:its(; fillers and those in given examl)les. In this palter , we introdu<:c the degree of (:based apl)roach ill exl)laining its effe.etivity for verb sense disamibiguation in Japanese. A representative example-based method for verb sense disambiguation was proposed by Kurohashi and Nagao (Kurohashi's inethod) (Kurohashi rand Nagao, 1994). Their method uses an 0,xamph; database, containing examples of collocations as in figure 1. Figure 1 shows a fragment of tim entry associated wittl the Japan(;se verb to'ru. As with most words, the ve, rb to'r"¢t has multipie senses, examples of whit:h are "to take/steal," "to attain," "to subst'ril)e" and "to reserve," The database gives one or more case frame(s) associated with tilt', verbs for each of their senses. In .Japanese, a coutI)lelnt;nt Of a verb, which is a constituent of the case frame of the verb, consists of a nonii phrase (case filler) followed by a case marker such ms ga (nominative) or o (accusative). The database has ~m example set of case fillers for each case. As shown in figure 1, examples of a comi)lement c.an be considered as an extensional description of the selectional restriction on it. The task (:onside.red in this paper is %o interpret" a verb in an input s('.ntcnt:e, i.e. to choose ()lit) sense from a set of candidate senses of the verb. Given an input sentence, Kurohashi's method interprets the verb in the input by computing semantic similarity between the input and exalnples. For this computation, Kurohashi 's nmthod experimeIltally uses the Ja,panese word thesaurus Bunruigoihyo (National-Language R(> search Institute, 1964). As with Inost thesauruses, the length of the 1lath between two words in Bunr'uigoihyo is exl)e, eted tt) reflect the similarity be,tween them. Figure 2 ilhlstrates a fragment of B'unruigoihyo in(:hlding some of the nouns in figure 1. I,et us take the example sentence (1). (1) hisho .qa sh, indaish, a o tor,u. (set:retm'y-NOM) (siegel,trig (:ar-ACC) (?) lit this examph',, it may t)e judged according to tigure 2 that h, ish, o ("secretary") and shindaisha ("sleeping c a r " ) i n (1)i~l(, ,'~emantically similar to joshu ("assistant") att(l hikbki ("airplane"), reSl)ectively, which are cxamI)les that collocate with t(rru ("to reserve"). As sut'h, the sense of rot'u, in (1) can be interpreted as "to reserve." llowever, in Kurohashi's nmthod, several usefifl properties for verb disambuguatittn are missing: 1. httuitively speaking, the, contribution of the
منابع مشابه
Automatic Sense Disambiguation for Target Word Selection
This paper describes a method of automatic sense disambiguation for target word selection in Korean to English machine translation. At first, we define the concept of cluster for each sense of given verb according to corresponding target word. And then, we propose a method which selects the sense combination of words as the correct sense that has the greatest number of overlaps between input ca...
متن کاملImproving Japanese Zero Pronoun Resolution by Global Word Sense Disambiguation
This paper proposes unsupervised word sense disambiguation based on automatically constructed case frames and its incorporation into our zero pronoun resolution system. The word sense disambiguation is applied to verbs and nouns. We consider that case frames define verb senses and semantic features in a thesaurus define noun senses, respectively, and perform sense disambiguation by selecting th...
متن کاملArgument Status in Japanese Verb Sense Disambiguation
This research aims to incorporate argument status-based modelling within an otherwise selectional constraint-based system of verb sense disambiguation, to capture effects such as underspecification, surface case alternation and semantic backingoff. The proposed implementation hinges around a description of the general behavioural characteristics of integral complements, complements, middles and...
متن کاملDFKI: Multi-objective Optimization for the Joint Disambiguation of Entities and Nouns & Deep Verb Sense Disambiguation
We introduce an approach to word sense disambiguation and entity linking that combines a set of complementary objectives in an extensible multi-objective formalism. During disambiguation the system performs continuous optimization to find optimal probability distributions over candidate senses. Verb senses are disambiguated using a separate neural network model. Our results on noun and verb sen...
متن کاملCorpus-Based Word Sense Disambiguation
human animal organization plant parts natural products action mental linguistic products character relation location time quantity Figure 3.4: The hierarchy of IPAL semantic features In this example, one may consider hisho (\secretary") and shindaisha (\sleeping car") to be semantically similar to joshu (\assistant") and hikouki (\airplane") respectively, and since both collocate with the \to r...
متن کامل